Virtual Datasets #8708

frcroth · 2025-06-23T12:58:18Z

URL of deployed dev instance (used for testing):

https://___.webknossos.xyz

Steps to test:

Upload dataset
Explore remote dataset
View datasets
Create, update and save annotations
Update datasets in WK
Delete datasets
Dataset composition
Zarr streaming
Zarr streaming via Legacy Routes
Make changes to JSON and check if they are registered in DB

Implementation notes

DataSource Id is still used internally in the datastore for various caches, also in the binary data handling.
Handling of real disk data sources is still done via orgid and datasetDirectoryName (e.g., uploading, storage size)
Everything else should use dataset ids.

TODOs:

Issues:

fixes Virtual datasets #8384

Followups:

Readable URIs for zarr routes
Test remote datasets that are recreated when refreshing schema.
Protocol Changes
- Deletion in WK -> WK asks datastore to delete on disk if not virtual
- Do we need to call register datasets at the datastore? The datastore does not need to be involved in creation of virtual datasets
Can we make datasetDirectoryName optional?
make the datastore property optional?
remove DataSourceRepository entirely, and have the checkInbox function report to wk directly

(Please delete unneeded items, merge only when none are left open)

Added changelog entry (create a $PR_NUMBER.md file in unreleased_changes or use ./tools/create-changelog-entry.py)
Added migration guide entry if applicable (edit the same file as for the changelog)
Updated documentation if applicable
Adapted wk-libs python client if relevant API parts change
Removed dev-only changes like prints and application.conf edits
Considered common edge cases
Needs datastore update after deployment

coderabbitai · 2025-06-23T12:58:35Z

📝 Walkthrough

Walkthrough

This change introduces "virtual datasets" by shifting dataset identification and management from organization/directory-based keys to a unified dataset ID (ObjectId). It updates backend and frontend APIs, controllers, services, and database schemas to use dataset IDs throughout, removes legacy fields and methods, and adds support for creating, updating, and deleting virtual datasets. The codebase now consistently references datasets by ID, with new and updated endpoints, routes, and schema migrations.

Changes

Cohort / File(s)	Change Summary
Controllers: Dataset, Annotation, UserToken, RemoteDataStore, TracingStore `app/controllers/AnnotationIOController.scala`, `app/controllers/DatasetController.scala`, `app/controllers/UserTokenController.scala`, `app/controllers/WKRemoteDataStoreController.scala`, `app/controllers/WKRemoteTracingStoreController.scala`	Controller methods updated to use dataset IDs instead of organization/directory names; new endpoints for virtual dataset management; access validation extended for delete; compose endpoint added.
Dataset Model, Service, Compose Logic `app/models/dataset/ComposeService.scala`, `app/models/dataset/Dataset.scala`, `app/models/dataset/DatasetService.scala`, `app/models/dataset/explore/WKExploreRemoteLayerService.scala`, `app/models/annotation/AnnotationService.scala`	Virtual dataset support: new ComposeService, new field `isVirtual`, logic for creating/updating virtual datasets, support for WKW layers, method signature updates, and removal of legacy compose logic.
Remote Data Store Clients `app/models/dataset/WKRemoteDataStoreClient.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DSRemoteWebknossosClient.scala`, `webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteDatastoreClient.scala`, `webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteWebknossosClient.scala`	API clients and caches updated to use dataset IDs for all dataset operations, new endpoints for registration, update, deletion, and dataset ID lookup.
Data Layer Model and Helpers `webknossos-datastore/app/com/scalableminds/webknossos/datastore/models/datasource/DataLayer.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/WKWDataLayers.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/N5DataLayers.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/PrecomputedDataLayers.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/Zarr3DataLayers.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/ZarrDataLayers.scala`, `webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/RemoteFallbackLayer.scala`, `webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/volume/VolumeTracingService.scala`, `webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/volume/VolumeTracingBucketHelper.scala`, `webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/volume/VolumeTracingLayer.scala`, `webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/editablemapping/EditableMappingLayer.scala`	Remove legacy fields (e.g., `wkwResolutions`), unify on `mags`; update all related logic and serialization; remove cube size indirection; update fallback layer construction to use dataset IDs.
Datastore Controllers and Services `webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/BinaryDataController.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DataSourceController.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DSMeshController.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/ZarrStreamingController.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/AccessTokenService.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DatasetCache.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DataSourceService.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/DSFullMeshService.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/helpers/DatasetDeleter.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/UploadService.scala`	All dataset-related endpoints and internal logic refactored to use dataset IDs; cache and access token logic updated; new/updated methods for registration, update, and deletion; helper methods updated for symlink and disk operations.
Frontend API, Types, and Components `frontend/javascripts/admin/api/mesh.ts`, `frontend/javascripts/admin/rest_api.ts`, `frontend/javascripts/admin/dataset/composition_wizard/04_configure_new_dataset.tsx`, `frontend/javascripts/dashboard/advanced_dataset/dataset_action_view.tsx`, `frontend/javascripts/dashboard/dataset/dataset_settings_data_tab.tsx`, `frontend/javascripts/dashboard/dataset/dataset_settings_delete_tab.tsx`, `frontend/javascripts/dashboard/dataset/dataset_settings_provider.tsx`, `frontend/javascripts/dashboard/dataset/dataset_settings_viewconfig_tab.tsx`, `frontend/javascripts/types/api_types.ts`, `frontend/javascripts/types/schemas/datasource.schema.ts`, `frontend/javascripts/types/schemas/datasource.types.ts`, `frontend/javascripts/viewer/model/bucket_data_handling/wkstore_adapter.ts`, `frontend/javascripts/viewer/model/sagas/load_histogram_data_saga.ts`, `frontend/javascripts/viewer/model/sagas/meshes/precomputed_mesh_saga.ts`, `frontend/javascripts/viewer/view/right-border-tabs/connectome_tab/connectome_view.tsx`, `frontend/javascripts/viewer/view/right-border-tabs/segments_tab/segments_view_helper.tsx`	All API calls, types, and UI components refactored to use dataset IDs; type definitions updated (`LayerLink`, `DataLayerWKWPartial`); all usages of `wkwResolutions` replaced with `mags`; composition wizard and settings updated.
Tests `frontend/javascripts/test/backend-snapshot-tests/datasets.e2e.ts`, `frontend/javascripts/test/model/binary/layers/wkstore_adapter.spec.ts`	Tests updated to use dataset IDs instead of organization/directory names; helper functions added for resolving dataset IDs.
Routes and Schema `conf/webknossos.latest.routes`, `conf/webknossos.versioned.routes`, `conf/evolutions/137-virtual-datasets.sql`, `conf/evolutions/reversions/137-virtual-datasets.sql`, `webknossos-datastore/conf/datastore.latest.routes`, `webknossos-datastore/conf/datastore.versioned.routes`, `webknossos-tracingstore/conf/tracingstore.versioned.routes`, `tools/postgres/schema.sql`	All routes updated to use dataset IDs; new API version (v10) introduced; schema migration for `isVirtual` and removal of `cubeLength`; migration scripts for upgrade/downgrade.
Documentation and Unreleased Changes `docs/data/concepts.md`, `unreleased_changes/8708.md`	Documentation and unreleased changes updated to reflect `mags` replacing `wkwResolutions`, dataset ID usage, and breaking changes for client libraries.
Legacy and Removed Code `webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/LegacyController.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/WKDatasetController.scala`, `webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/ComposeService.scala`	Added legacy controller for v9 and earlier; removed old dataset controller and old compose service.

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~90+ minutes

Complexity: Critical — The changes are extensive, cross-cutting, and high-impact, affecting core backend, frontend, API, schema, and documentation. They introduce new concepts, remove legacy code, and require careful validation of backward compatibility and migration logic.

Assessment against linked issues

Objective	Addressed	Explanation
Support for virtual datasets in the database, not requiring a folder on disk (#8384)	✅
Use dataset IDs (ObjectId) for all dataset identification and API communication (#8384)	✅
Allow creation, update, and deletion of virtual datasets (#8384)	✅
Remove reliance on symlinks and directory-based identification (#8384)	✅
Remove `wkwResolutions`, use `mags` for all new datasets (#8384)	✅

Assessment against linked issues: Out-of-scope changes

No out-of-scope changes identified. All major functional changes directly support the objectives of virtual datasets, dataset ID usage, and related schema/API migrations described in the linked issue.

Possibly related PRs

Virtual Remote Datasets #8657: Also refactors dataset access to use dataset IDs instead of organization/datasetName, touching similar controllers and clients.
Use ObjectId directly as parameter in routes #8285: Refactors method signatures and route parameters to use ObjectId directly, aligning with the dataset ID migration in this PR.
Warn user when using precomputed meshes after brushing #8218: Modifies the same createVolumeTracing method, albeit for a different purpose, indicating code-level overlap in annotation service logic.

Suggested labels

frontend, enhancement

Suggested reviewers

fm3
MichaelBuessemeyer
daniel-wer

Poem

🐰✨
A hop, a leap, a dataset ID,
No more folders, no more symlinks—free!
Virtual dreams now take their place,
With "mags" and IDs in every space.
Old ways gone, new routes begun,
Webknossos' journey has just begun!
🥕

Note

⚡️ Unit Test Generation is now available in beta!

Learn more here, or try it out under "Finishing Touches" below.

✨ Finishing Touches

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch explore-virtual-datasets

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai generate unit tests to generate unit tests for this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

…etid routes

coderabbitai

Actionable comments posted: 2

🔭 Outside diff range comments (1)

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/ZarrStreamingController.scala (1)

130-179: Remove unused method

The convertLayerToZarrLayer method appears to be dead code since this logic has been moved to ZarrStreamingService. This method is not called anywhere in the controller.

-  private def convertLayerToZarrLayer(layer: DataLayer, zarrVersion: Int): ZarrLayer = {
-    val dataFormat = if (zarrVersion == 2) DataFormat.zarr else DataFormat.zarr3
-    layer match {
-      case s: SegmentationLayer =>
-        val rank = s.additionalAxes.map(_.length).getOrElse(0) + 4 // We're writing c, additionalAxes, xyz
-        ZarrSegmentationLayer(
-          s.name,
-          s.boundingBox,
-          s.elementClass,
-          mags = s.sortedMags.map(
-            m =>
-              MagLocator(m,
-                         Some(s"./${s.name}/${m.toMagLiteral(allowScalar = true)}"),
-                         None,
-                         Some(AxisOrder.cAdditionalxyz(rank)),
-                         None,
-                         None)),
-          mappings = s.mappings,
-          largestSegmentId = s.largestSegmentId,
-          numChannels = Some(if (s.elementClass == ElementClass.uint24) 3 else 1),
-          defaultViewConfiguration = s.defaultViewConfiguration,
-          adminViewConfiguration = s.adminViewConfiguration,
-          coordinateTransformations = s.coordinateTransformations,
-          additionalAxes = s.additionalAxes.map(reorderAdditionalAxes),
-          dataFormat = dataFormat
-        )
-      case d: DataLayer =>
-        val rank = d.additionalAxes.map(_.length).getOrElse(0) + 4 // We're writing c, additionalAxes, xyz
-        ZarrDataLayer(
-          d.name,
-          d.category,
-          d.boundingBox,
-          d.elementClass,
-          mags = d.sortedMags.map(
-            m =>
-              MagLocator(m,
-                         Some(s"./${d.name}/${m.toMagLiteral(allowScalar = true)}"),
-                         None,
-                         Some(AxisOrder.cAdditionalxyz(rank)),
-                         None,
-                         None)),
-          numChannels = Some(if (d.elementClass == ElementClass.uint24) 3 else 1),
-          defaultViewConfiguration = d.defaultViewConfiguration,
-          adminViewConfiguration = d.adminViewConfiguration,
-          coordinateTransformations = d.coordinateTransformations,
-          additionalAxes = d.additionalAxes.map(reorderAdditionalAxes),
-          dataFormat = dataFormat
-        )
-    }
-  }

🧹 Nitpick comments (6)

docs/data/concepts.md (1)
125-127: Minor formatting: Consider using consistent list markers.

The static analysis tools flag inconsistent list marker usage. Consider using dashes consistently:
-  + `dataLayers.mags`: Holds information about the available magnification steps of the layer.
-    * `dataLayers.mags.mag`: A 3-tuple (e.g., `[1, 1, 1]`, `[2, 2, 1]`) for uniform or non-uniform magnifications.
-    * `dataLayers.mags.path`: The path to the directory containing the data for this magnification step.
+  - `dataLayers.mags`: Holds information about the available magnification steps of the layer.
+    - `dataLayers.mags.mag`: A 3-tuple (e.g., `[1, 1, 1]`, `[2, 2, 1]`) for uniform or non-uniform magnifications.
+    - `dataLayers.mags.path`: The path to the directory containing the data for this magnification step.
app/models/dataset/ComposeService.scala (1)
73-73: Consider supporting additional layer types.

Currently only DataLayerWithMagLocators is supported. Consider documenting why other layer types are excluded or plan for future support.
-        case _ => Fox.failure("Unsupported layer type for composition: " + layer.getClass.getSimpleName)
+        case _ => Fox.failure(s"Unsupported layer type for composition: ${layer.getClass.getSimpleName}. Only DataLayerWithMagLocators is currently supported.")
app/models/dataset/DatasetService.scala (1)

291-295: Assess performance of getVirtualDatasetIds()

The DAO method loads all virtual dataset IDs into memory in one go. If you have a large number of virtual datasets, this query and the in-memory list could become a bottleneck. Consider whether you need to:

Stream results instead of calling .toList (e.g., Slick streaming or an FS2/Alpakka approach).

Add pagination or a row-limit to the query.

Cache the ID list if it changes infrequently.

Attention needed at:

app/models/dataset/Dataset.scala: lines 522–529 (definition of getVirtualDatasetIds)

app/models/dataset/DatasetService.scala: line 292 (where it’s invoked)
frontend/javascripts/admin/rest_api.ts (2)
1107-1112: Consider removing legacy formatting

The comment indicates this is formatting data for the "old format" that the backend can parse, and mentions removing the datasetId. This suggests a transitional implementation. Consider creating a follow-up task to fully migrate the backend to the new format.

1365-1370: Redundant parameters in function call

The function passes both dataset and dataset.id to triggerDatasetClearCache. Since the dataset object contains the ID, consider refactoring to pass only the necessary parameter.
-    triggerDatasetClearCache(dataset.dataStore.url, dataset, dataset.id, layerName),
+    triggerDatasetClearCache(dataset.dataStore.url, dataset.id, layerName),
webknossos-datastore/conf/datastore.latest.routes (1)

110-123: Mixed parameter patterns for backward compatibility

Some routes still use organizationId and datasetName parameters while others use datasetId. This mixed approach appears intentional for:

Organization-level operations (measureUsedStorage)

Dataset creation (add)

Legacy support (reload trigger)

Consider documenting which routes are legacy vs. new pattern for maintainability.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3f49365 and 4bef4a8.

⛔ Files ignored due to path filters (1)

test/db/dataSets.csv is excluded by !**/*.csv

📒 Files selected for processing (59)

app/controllers/AnnotationIOController.scala (2 hunks)
app/controllers/DatasetController.scala (3 hunks)
app/controllers/UserTokenController.scala (1 hunks)
app/controllers/WKRemoteDataStoreController.scala (3 hunks)
app/controllers/WKRemoteTracingStoreController.scala (2 hunks)
app/models/annotation/AnnotationService.scala (4 hunks)
app/models/dataset/ComposeService.scala (1 hunks)
app/models/dataset/Dataset.scala (7 hunks)
app/models/dataset/DatasetService.scala (10 hunks)
app/models/dataset/WKRemoteDataStoreClient.scala (5 hunks)
app/models/dataset/explore/WKExploreRemoteLayerService.scala (1 hunks)
conf/evolutions/136-virtual-datasets.sql (1 hunks)
conf/evolutions/reversions/136-virtual-datasets.sql (1 hunks)
conf/webknossos.latest.routes (3 hunks)
conf/webknossos.versioned.routes (2 hunks)
docs/data/concepts.md (6 hunks)
frontend/javascripts/admin/api/mesh.ts (4 hunks)
frontend/javascripts/admin/dataset/composition_wizard/04_configure_new_dataset.tsx (1 hunks)
frontend/javascripts/admin/rest_api.ts (20 hunks)
frontend/javascripts/dashboard/dataset/dataset_settings_data_tab.tsx (0 hunks)
frontend/javascripts/dashboard/dataset/dataset_settings_delete_tab.tsx (1 hunks)
frontend/javascripts/dashboard/dataset/dataset_settings_view.tsx (2 hunks)
frontend/javascripts/dashboard/dataset/dataset_settings_viewconfig_tab.tsx (3 hunks)
frontend/javascripts/test/backend-snapshot-tests/datasets.e2e.ts (3 hunks)
frontend/javascripts/test/model/binary/layers/wkstore_adapter.spec.ts (5 hunks)
frontend/javascripts/types/schemas/datasource.schema.ts (2 hunks)
frontend/javascripts/types/schemas/datasource.types.ts (1 hunks)
frontend/javascripts/viewer/model/bucket_data_handling/wkstore_adapter.ts (1 hunks)
frontend/javascripts/viewer/model/sagas/load_histogram_data_saga.ts (1 hunks)
frontend/javascripts/viewer/model/sagas/meshes/precomputed_mesh_saga.ts (2 hunks)
frontend/javascripts/viewer/view/right-border-tabs/connectome_tab/connectome_view.tsx (2 hunks)
frontend/javascripts/viewer/view/right-border-tabs/segments_tab/segments_view_helper.tsx (1 hunks)
tools/postgres/schema.sql (2 hunks)
unreleased_changes/8708.md (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/DataStoreModule.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/BinaryDataController.scala (10 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DSMeshController.scala (6 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DataSourceController.scala (18 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/LegacyController.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/WKDatasetController.scala (0 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/ZarrStreamingController.scala (12 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/WKWDataLayers.scala (5 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/helpers/DatasetDeleter.scala (4 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/models/datasource/DataLayer.scala (2 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/AccessTokenService.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DSRemoteWebknossosClient.scala (4 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DataSourceService.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DatasetCache.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/ZarrStreamingService.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/DSFullMeshService.scala (3 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/ComposeService.scala (0 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/UploadService.scala (2 hunks)
webknossos-datastore/conf/datastore.latest.routes (4 hunks)
webknossos-datastore/conf/datastore.versioned.routes (1 hunks)
webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteDatastoreClient.scala (2 hunks)
webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteWebknossosClient.scala (3 hunks)
webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/RemoteFallbackLayer.scala (2 hunks)
webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/volume/VolumeTracingService.scala (1 hunks)
webknossos-tracingstore/conf/tracingstore.versioned.routes (1 hunks)

💤 Files with no reviewable changes (3)

frontend/javascripts/dashboard/dataset/dataset_settings_data_tab.tsx
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/WKDatasetController.scala
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/ComposeService.scala

🧰 Additional context used

🧠 Learnings (49)

frontend/javascripts/viewer/model/sagas/load_histogram_data_saga.ts (1)

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/proofread_saga.ts:1039-1039
Timestamp: 2024-11-22T17:18:04.217Z
Learning: In frontend/javascripts/oxalis/model/sagas/proofread_saga.ts, when calling getMagInfo, the use of volumeTracingLayer.resolutions is intentional and should not be changed to volumeTracingLayer.mags.

frontend/javascripts/test/model/binary/layers/wkstore_adapter.spec.ts (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DatasetCache.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

frontend/javascripts/dashboard/dataset/dataset_settings_delete_tab.tsx (1)

Learnt from: dieknolle3333
PR: #8229
File: frontend/javascripts/oxalis/model/accessors/dataset_accessor.ts:348-354
Timestamp: 2024-11-25T14:38:49.345Z
Learning: For the getDatasetExtentAsProduct function in dataset_accessor.ts, input validation for negative or zero dimensions is not necessary.

frontend/javascripts/viewer/view/right-border-tabs/segments_tab/segments_view_helper.tsx (4)

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/proofread_saga.ts:1039-1039
Timestamp: 2024-11-22T17:18:04.217Z
Learning: In frontend/javascripts/oxalis/model/sagas/proofread_saga.ts, when calling getMagInfo, the use of volumeTracingLayer.resolutions is intentional and should not be changed to volumeTracingLayer.mags.

Learnt from: dieknolle3333
PR: #8229
File: frontend/javascripts/oxalis/model/accessors/dataset_accessor.ts:348-354
Timestamp: 2024-11-25T14:38:49.345Z
Learning: For the getDatasetExtentAsProduct function in dataset_accessor.ts, input validation for negative or zero dimensions is not necessary.

Learnt from: philippotto
PR: #8602
File: frontend/javascripts/oxalis/model/volumetracing/volume_annotation_sampling.ts:365-366
Timestamp: 2025-05-07T06:17:32.810Z
Learning: The parameter in applyVoxelMap was renamed from sliceCount to sliceOffset to better reflect its purpose, but this doesn't affect existing call sites since JavaScript/TypeScript function calls are position-based.

Learnt from: MichaelBuessemeyer
PR: #8485
File: frontend/javascripts/oxalis/model/accessors/dataset_layer_transformation_accessor.ts:384-392
Timestamp: 2025-04-01T09:45:17.527Z
Learning: The function isRotationAndMirrorMaybeOnly in the dataset_layer_transformation_accessor.ts is intentionally designed to allow mirroring transformations (negative scale values). It uses the length comparison (scale.length() === NON_SCALED_VECTOR.length()) rather than component equality to permit mirrored axes while ensuring the overall scale magnitude remains the same.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/UploadService.scala (1)

Learnt from: frcroth
PR: #8202
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DatasetErrorLoggingService.scala:68-72
Timestamp: 2024-11-25T10:02:03.702Z
Learning: In DatasetErrorLoggingService.scala, prefer using TextUtils.stackTraceAsString(exception) when logging exceptions instead of passing the exception directly to logger.error.

frontend/javascripts/admin/dataset/composition_wizard/04_configure_new_dataset.tsx (1)

Learnt from: dieknolle3333
PR: #8229
File: frontend/javascripts/oxalis/model/accessors/dataset_accessor.ts:348-354
Timestamp: 2024-11-25T14:38:49.345Z
Learning: For the getDatasetExtentAsProduct function in dataset_accessor.ts, input validation for negative or zero dimensions is not necessary.

frontend/javascripts/viewer/view/right-border-tabs/connectome_tab/connectome_view.tsx (1)

Learnt from: philippotto
PR: #8542
File: frontend/javascripts/viewer/model/sagas/volumetracing_saga.tsx:0-0
Timestamp: 2025-05-30T12:36:09.930Z
Learning: The WebKnossos frontend follows Redux immutability principles where reducers act immutably and create new state objects when changes occur. This makes reference equality checks (like prevVolumeTracing.segments !== volumeTracing.segments) reliable and efficient for detecting actual state changes, allowing memoization to work correctly.

frontend/javascripts/viewer/model/sagas/meshes/precomputed_mesh_saga.ts (5)

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/proofread_saga.ts:1039-1039
Timestamp: 2024-11-22T17:18:04.217Z
Learning: In frontend/javascripts/oxalis/model/sagas/proofread_saga.ts, when calling getMagInfo, the use of volumeTracingLayer.resolutions is intentional and should not be changed to volumeTracingLayer.mags.

Learnt from: philippotto
PR: #8602
File: frontend/javascripts/oxalis/model/volumetracing/volume_annotation_sampling.ts:365-366
Timestamp: 2025-05-07T06:17:32.810Z
Learning: The parameter in applyVoxelMap was renamed from sliceCount to sliceOffset to better reflect its purpose, but this doesn't affect existing call sites since JavaScript/TypeScript function calls are position-based.

Learnt from: dieknolle3333
PR: #8229
File: frontend/javascripts/oxalis/model/accessors/dataset_accessor.ts:348-354
Timestamp: 2024-11-25T14:38:49.345Z
Learning: For the getDatasetExtentAsProduct function in dataset_accessor.ts, input validation for negative or zero dimensions is not necessary.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts:568-585
Timestamp: 2024-11-22T17:18:43.411Z
Learning: In the file frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts, the uniform name allResolutions should remain unchanged to maintain consistency with the shader code.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/NeuroglancerMeshHelper.scala:143-152
Timestamp: 2025-04-30T08:54:11.252Z
Learning: In Neuroglancer's sharded format, mesh fragment data is located immediately before the manifest file in the same shard data file. The correct way to calculate the mesh data starting offset is to subtract the total mesh size from the manifest starting offset (neuroglancerOffsetStart - totalMeshSize), not by adding the manifest size.

frontend/javascripts/dashboard/dataset/dataset_settings_view.tsx (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/volume/VolumeTracingService.scala (4)

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/proofread_saga.ts:1039-1039
Timestamp: 2024-11-22T17:18:04.217Z
Learning: In frontend/javascripts/oxalis/model/sagas/proofread_saga.ts, when calling getMagInfo, the use of volumeTracingLayer.resolutions is intentional and should not be changed to volumeTracingLayer.mags.

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

tools/postgres/schema.sql (1)

Learnt from: frcroth
PR: #8609
File: conf/evolutions/133-datasource-properties-in-db.sql:8-16
Timestamp: 2025-05-12T14:15:05.259Z
Learning: The database schema in WEBKNOSSOS has separate tables for dataset layers (dataset_layers) and magnifications (dataset_mags). The dataFormat field is stored in the layers table while magnification-specific fields like cubeLength (specific to WKW format) are stored in the mags table.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/AccessTokenService.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/models/datasource/DataLayer.scala (4)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8609
File: conf/evolutions/133-datasource-properties-in-db.sql:8-16
Timestamp: 2025-05-12T14:15:05.259Z
Learning: The database schema in WEBKNOSSOS has separate tables for dataset layers (dataset_layers) and magnifications (dataset_mags). The dataFormat field is stored in the layers table while magnification-specific fields like cubeLength (specific to WKW format) are stored in the mags table.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/volumetracing_saga.tsx:433-434
Timestamp: 2024-11-22T17:19:07.947Z
Learning: In the codebase, certain usages of segmentationLayer.resolutions are intentionally retained and should not be changed to segmentationLayer.mags during refactoring.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/proofread_saga.ts:1039-1039
Timestamp: 2024-11-22T17:18:04.217Z
Learning: In frontend/javascripts/oxalis/model/sagas/proofread_saga.ts, when calling getMagInfo, the use of volumeTracingLayer.resolutions is intentional and should not be changed to volumeTracingLayer.mags.

frontend/javascripts/types/schemas/datasource.types.ts (1)

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts:568-585
Timestamp: 2024-11-22T17:18:43.411Z
Learning: In the file frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts, the uniform name allResolutions should remain unchanged to maintain consistency with the shader code.

app/controllers/UserTokenController.scala (4)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/NeuroglancerPrecomputedMeshFileService.scala:161-166
Timestamp: 2025-04-28T14:18:04.368Z
Learning: In Scala for-comprehensions with the Fox error handling monad, Fox.fromBool() expressions should use the <- binding operator instead of the = assignment operator to properly propagate error conditions. Using = will cause validation failures to be silently ignored.

frontend/javascripts/test/backend-snapshot-tests/datasets.e2e.ts (1)

Learnt from: dieknolle3333
PR: #8229
File: frontend/javascripts/oxalis/model/accessors/dataset_accessor.ts:348-354
Timestamp: 2024-11-25T14:38:49.345Z
Learning: For the getDatasetExtentAsProduct function in dataset_accessor.ts, input validation for negative or zero dimensions is not necessary.

app/controllers/AnnotationIOController.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

frontend/javascripts/viewer/model/bucket_data_handling/wkstore_adapter.ts (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

app/controllers/WKRemoteTracingStoreController.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/RemoteFallbackLayer.scala (3)

Learnt from: frcroth
PR: #8202
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DatasetErrorLoggingService.scala:68-72
Timestamp: 2024-11-25T10:02:03.702Z
Learning: In DatasetErrorLoggingService.scala, prefer using TextUtils.stackTraceAsString(exception) when logging exceptions instead of passing the exception directly to logger.error.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

app/models/dataset/explore/WKExploreRemoteLayerService.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

app/controllers/DatasetController.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

docs/data/concepts.md (8)

Learnt from: frcroth
PR: #8609
File: conf/evolutions/133-datasource-properties-in-db.sql:8-16
Timestamp: 2025-05-12T14:15:05.259Z
Learning: The database schema in WEBKNOSSOS has separate tables for dataset layers (dataset_layers) and magnifications (dataset_mags). The dataFormat field is stored in the layers table while magnification-specific fields like cubeLength (specific to WKW format) are stored in the mags table.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/volumetracing_saga.tsx:433-434
Timestamp: 2024-11-22T17:19:07.947Z
Learning: In the codebase, certain usages of segmentationLayer.resolutions are intentionally retained and should not be changed to segmentationLayer.mags during refactoring.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/proofread_saga.ts:1039-1039
Timestamp: 2024-11-22T17:18:04.217Z
Learning: In frontend/javascripts/oxalis/model/sagas/proofread_saga.ts, when calling getMagInfo, the use of volumeTracingLayer.resolutions is intentional and should not be changed to volumeTracingLayer.mags.

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: MichaelBuessemeyer
PR: #8221
File: CHANGELOG.unreleased.md:14-14
Timestamp: 2024-11-25T14:52:23.201Z
Learning: For the webknossos project, changelog entries in CHANGELOG.unreleased.md should be concise and avoid unnecessary detailed explanations.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/reducers/skeletontracing_reducer_helpers.ts:125-125
Timestamp: 2024-11-22T17:17:39.914Z
Learning: In the createNode function within skeletontracing_reducer_helpers.ts, the property resolution is intentionally kept even when the parameter is renamed to mag.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts:568-585
Timestamp: 2024-11-22T17:18:43.411Z
Learning: In the file frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts, the uniform name allResolutions should remain unchanged to maintain consistency with the shader code.

Learnt from: frcroth
PR: #8598
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/models/datasource/DatasetLayerAttachments.scala:89-95
Timestamp: 2025-06-02T09:49:51.047Z
Learning: In WebKnossos dataset layer attachments, multiple file types can safely use the same directory name (like "agglomerates") because the scanning logic filters by file extension. For example, AgglomerateFileInfo scans for .hdf5 files while CumsumFileInfo scans for .json files in the same "agglomerates" directory without interference.

frontend/javascripts/admin/api/mesh.ts (3)

Learnt from: philippotto
PR: #8602
File: frontend/javascripts/oxalis/model/volumetracing/volume_annotation_sampling.ts:365-366
Timestamp: 2025-05-07T06:17:32.810Z
Learning: The parameter in applyVoxelMap was renamed from sliceCount to sliceOffset to better reflect its purpose, but this doesn't affect existing call sites since JavaScript/TypeScript function calls are position-based.

Learnt from: dieknolle3333
PR: #8229
File: frontend/javascripts/oxalis/model/accessors/dataset_accessor.ts:348-354
Timestamp: 2024-11-25T14:38:49.345Z
Learning: For the getDatasetExtentAsProduct function in dataset_accessor.ts, input validation for negative or zero dimensions is not necessary.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts:568-585
Timestamp: 2024-11-22T17:18:43.411Z
Learning: In the file frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts, the uniform name allResolutions should remain unchanged to maintain consistency with the shader code.

frontend/javascripts/dashboard/dataset/dataset_settings_viewconfig_tab.tsx (2)

Learnt from: dieknolle3333
PR: #8229
File: frontend/javascripts/oxalis/model/accessors/dataset_accessor.ts:348-354
Timestamp: 2024-11-25T14:38:49.345Z
Learning: For the getDatasetExtentAsProduct function in dataset_accessor.ts, input validation for negative or zero dimensions is not necessary.

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

app/models/dataset/ComposeService.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteWebknossosClient.scala (1)

Learnt from: MichaelBuessemeyer
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/datareaders/precomputed/NeuroglancerPrecomputedShardingUtils.scala:23-28
Timestamp: 2025-04-25T11:06:13.275Z
Learning: AlfuCache in the WebKnossos codebase has default configured TTL (time-to-live) and size limits, providing automatic protection against unbounded memory growth.

frontend/javascripts/types/schemas/datasource.schema.ts (6)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8609
File: conf/evolutions/133-datasource-properties-in-db.sql:8-16
Timestamp: 2025-05-12T14:15:05.259Z
Learning: The database schema in WEBKNOSSOS has separate tables for dataset layers (dataset_layers) and magnifications (dataset_mags). The dataFormat field is stored in the layers table while magnification-specific fields like cubeLength (specific to WKW format) are stored in the mags table.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/proofread_saga.ts:1039-1039
Timestamp: 2024-11-22T17:18:04.217Z
Learning: In frontend/javascripts/oxalis/model/sagas/proofread_saga.ts, when calling getMagInfo, the use of volumeTracingLayer.resolutions is intentional and should not be changed to volumeTracingLayer.mags.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts:568-585
Timestamp: 2024-11-22T17:18:43.411Z
Learning: In the file frontend/javascripts/oxalis/geometries/materials/plane_material_factory.ts, the uniform name allResolutions should remain unchanged to maintain consistency with the shader code.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/reducers/skeletontracing_reducer_helpers.ts:125-125
Timestamp: 2024-11-22T17:17:39.914Z
Learning: In the createNode function within skeletontracing_reducer_helpers.ts, the property resolution is intentionally kept even when the parameter is renamed to mag.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/volumetracing_saga.tsx:433-434
Timestamp: 2024-11-22T17:19:07.947Z
Learning: In the codebase, certain usages of segmentationLayer.resolutions are intentionally retained and should not be changed to segmentationLayer.mags during refactoring.

app/models/dataset/WKRemoteDataStoreClient.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

app/models/dataset/DatasetService.scala (5)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/NeuroglancerPrecomputedMeshFileService.scala:161-166
Timestamp: 2025-04-28T14:18:04.368Z
Learning: In Scala for-comprehensions with the Fox error handling monad, Fox.fromBool() expressions should use the <- binding operator instead of the = assignment operator to properly propagate error conditions. Using = will cause validation failures to be silently ignored.

Learnt from: frcroth
PR: #8609
File: conf/evolutions/133-datasource-properties-in-db.sql:8-16
Timestamp: 2025-05-12T14:15:05.259Z
Learning: The database schema in WEBKNOSSOS has separate tables for dataset layers (dataset_layers) and magnifications (dataset_mags). The dataFormat field is stored in the layers table while magnification-specific fields like cubeLength (specific to WKW format) are stored in the mags table.

app/controllers/WKRemoteDataStoreController.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

app/models/dataset/Dataset.scala (2)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8609
File: conf/evolutions/133-datasource-properties-in-db.sql:8-16
Timestamp: 2025-05-12T14:15:05.259Z
Learning: The database schema in WEBKNOSSOS has separate tables for dataset layers (dataset_layers) and magnifications (dataset_mags). The dataFormat field is stored in the layers table while magnification-specific fields like cubeLength (specific to WKW format) are stored in the mags table.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/helpers/DatasetDeleter.scala (3)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

app/models/annotation/AnnotationService.scala (2)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/proofread_saga.ts:1039-1039
Timestamp: 2024-11-22T17:18:04.217Z
Learning: In frontend/javascripts/oxalis/model/sagas/proofread_saga.ts, when calling getMagInfo, the use of volumeTracingLayer.resolutions is intentional and should not be changed to volumeTracingLayer.mags.

webknossos-datastore/conf/datastore.versioned.routes (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DSMeshController.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/DSFullMeshService.scala (2)

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DSRemoteWebknossosClient.scala (2)

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

conf/webknossos.latest.routes (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteDatastoreClient.scala (4)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: MichaelBuessemeyer
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/datareaders/precomputed/NeuroglancerPrecomputedShardingUtils.scala:23-28
Timestamp: 2025-04-25T11:06:13.275Z
Learning: AlfuCache in the WebKnossos codebase has default configured TTL (time-to-live) and size limits, providing automatic protection against unbounded memory growth.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/WKWDataLayers.scala (5)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/volumetracing_saga.tsx:433-434
Timestamp: 2024-11-22T17:19:07.947Z
Learning: In the codebase, certain usages of segmentationLayer.resolutions are intentionally retained and should not be changed to segmentationLayer.mags during refactoring.

Learnt from: frcroth
PR: #8609
File: conf/evolutions/133-datasource-properties-in-db.sql:8-16
Timestamp: 2025-05-12T14:15:05.259Z
Learning: The database schema in WEBKNOSSOS has separate tables for dataset layers (dataset_layers) and magnifications (dataset_mags). The dataFormat field is stored in the layers table while magnification-specific fields like cubeLength (specific to WKW format) are stored in the mags table.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/sagas/proofread_saga.ts:1039-1039
Timestamp: 2024-11-22T17:18:04.217Z
Learning: In frontend/javascripts/oxalis/model/sagas/proofread_saga.ts, when calling getMagInfo, the use of volumeTracingLayer.resolutions is intentional and should not be changed to volumeTracingLayer.mags.

Learnt from: dieknolle3333
PR: #8168
File: frontend/javascripts/oxalis/model/reducers/skeletontracing_reducer_helpers.ts:125-125
Timestamp: 2024-11-22T17:17:39.914Z
Learning: In the createNode function within skeletontracing_reducer_helpers.ts, the property resolution is intentionally kept even when the parameter is renamed to mag.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/ZarrStreamingService.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/ZarrStreamingController.scala (1)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/BinaryDataController.scala (4)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

Learnt from: MichaelBuessemeyer
PR: #8325
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/image/ImageCreator.scala:0-0
Timestamp: 2025-02-24T08:22:51.814Z
Learning: In ImageCreator.scala, precision loss during uint64 to double conversion is acceptable because the values are only used for image generation where each color component is ultimately represented as a single byte in ARGB format. For grayscale values (except uint24 which represents RGB), each color only uses 8 bits anyway.

webknossos-datastore/conf/datastore.latest.routes (3)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8598
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/models/datasource/DatasetLayerAttachments.scala:89-95
Timestamp: 2025-06-02T09:49:51.047Z
Learning: In WebKnossos dataset layer attachments, multiple file types can safely use the same directory name (like "agglomerates") because the scanning logic filters by file extension. For example, AgglomerateFileInfo scans for .hdf5 files while CumsumFileInfo scans for .json files in the same "agglomerates" directory without interference.

Learnt from: frcroth
PR: #8609
File: conf/evolutions/133-datasource-properties-in-db.sql:8-16
Timestamp: 2025-05-12T14:15:05.259Z
Learning: The database schema in WEBKNOSSOS has separate tables for dataset layers (dataset_layers) and magnifications (dataset_mags). The dataFormat field is stored in the layers table while magnification-specific fields like cubeLength (specific to WKW format) are stored in the mags table.

frontend/javascripts/admin/rest_api.ts (1)

Learnt from: dieknolle3333
PR: #8229
File: frontend/javascripts/oxalis/model/accessors/dataset_accessor.ts:348-354
Timestamp: 2024-11-25T14:38:49.345Z
Learning: For the getDatasetExtentAsProduct function in dataset_accessor.ts, input validation for negative or zero dimensions is not necessary.

conf/webknossos.versioned.routes (2)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8202
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DatasetErrorLoggingService.scala:68-72
Timestamp: 2024-11-25T10:02:03.702Z
Learning: In DatasetErrorLoggingService.scala, prefer using TextUtils.stackTraceAsString(exception) when logging exceptions instead of passing the exception directly to logger.error.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DataSourceController.scala (3)

Learnt from: frcroth
PR: #8609
File: app/models/dataset/Dataset.scala:753-775
Timestamp: 2025-05-12T13:07:29.637Z
Learning: In the updateMags method of DatasetMagsDAO (Scala), the code handles different dataset types distinctly:

Non-WKW datasets have magsOpt populated and use the first branch which includes axisOrder, channelIndex, and credentialId.
WKW datasets will have wkwResolutionsOpt populated and use the second branch which includes cubeLength.
The final branch is a fallback for legacy data.
This ensures appropriate fields are populated for each dataset type.

Learnt from: frcroth
PR: #8236
File: webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/MeshFileService.scala:170-173
Timestamp: 2025-04-23T08:51:57.756Z
Learning: In the webknossos codebase, classes extending FoxImplicits have access to an implicit conversion from Option[A] to Fox[A], where None is converted to an empty Fox that fails gracefully in for-comprehensions.

Learnt from: MichaelBuessemeyer
PR: #8352
File: app/models/organization/CreditTransactionService.scala:0-0
Timestamp: 2025-01-27T12:06:42.865Z
Learning: In Scala's for-comprehension with Fox (Future-like type), the <- operator ensures sequential execution. If any step fails, the entire chain short-circuits and returns early, preventing subsequent operations from executing. This makes it safe to perform validation checks before database operations.

🧬 Code Graph Analysis (6)

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DatasetCache.scala (2)

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DataSourceController.scala (1)

invalidateCache (685-690)

app/models/job/Job.scala (1)

datasetId (55-55)

frontend/javascripts/dashboard/dataset/dataset_settings_delete_tab.tsx (1)

frontend/javascripts/admin/rest_api.ts (1)

deleteDatasetOnDisk (1347-1357)

frontend/javascripts/admin/dataset/composition_wizard/04_configure_new_dataset.tsx (1)

frontend/javascripts/admin/rest_api.ts (1)

createDatasetComposition (1104-1122)

frontend/javascripts/viewer/view/right-border-tabs/connectome_tab/connectome_view.tsx (1)

frontend/javascripts/types/api_types.ts (1)

APIDataset (242-245)

frontend/javascripts/dashboard/dataset/dataset_settings_view.tsx (1)

frontend/javascripts/admin/rest_api.ts (1)

updateDatasetDatasource (984-995)

app/models/annotation/AnnotationService.scala (2)

app/models/dataset/WKRemoteDataStoreClient.scala (1)

hasSegmentIndexFile (80-90)

util/src/main/scala/com/scalableminds/util/objectid/ObjectId.scala (1)

toString (12-12)

🪛 LanguageTool

docs/data/concepts.md

[style] ~104-~104: For conciseness, consider replacing this expression with an adverb.
Context: ...ynonymously for mags throughout the UI. At the moment, WebKnossos guarantees correct renderin...

(AT_THE_MOMENT)

🪛 markdownlint-cli2 (0.17.2)

docs/data/concepts.md

125-125: Unordered list style
Expected: dash; Actual: plus

(MD004, ul-style)

126-126: Unordered list style
Expected: dash; Actual: asterisk

(MD004, ul-style)

127-127: Unordered list style
Expected: dash; Actual: asterisk

(MD004, ul-style)

🔇 Additional comments (115)

unreleased_changes/8708.md (1)

1-6: LGTM! Well-structured changelog entry.

The changelog appropriately documents the key behavioral changes and references the required database evolution script for deployment tracking.

webknossos-tracingstore/conf/tracingstore.versioned.routes (1)

3-3: LGTM! Proper API versioning for breaking changes.

Adding the v10 route prefix is appropriate for the dataset identification refactor and follows the established versioning pattern.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/DataStoreModule.scala (1)

67-67: LGTM! Proper dependency injection binding.

The eager singleton binding for ZarrStreamingService follows the established pattern and is necessary for the service to be available for injection.

frontend/javascripts/viewer/model/sagas/load_histogram_data_saga.ts (1)

40-40: LGTM! Consistent with dataset ID refactor.

The change from passing the full dataset object to dataset.id aligns with the broader refactor to unify dataset identification using dataset IDs instead of composite identifiers.

frontend/javascripts/dashboard/dataset/dataset_settings_delete_tab.tsx (1)

45-45: LGTM! Simplified API call with unified dataset identification.

The change from constructing a separate dataSourceId object to passing the full dataset object simplifies the API call and aligns with the dataset ID-based identification refactor throughout the codebase.

tools/postgres/schema.sql (2)

24-24: Schema version increment is appropriate.

The version bump to 136 correctly reflects the structural change to the datasets table.

110-110: Well-designed virtual dataset column addition.

The isVirtual boolean column is properly structured with:

NOT NULL constraint for data integrity

DEFAULT FALSE ensuring backward compatibility

Logical placement within the datasets table schema

This aligns perfectly with the PR's virtual dataset functionality.

frontend/javascripts/test/model/binary/layers/wkstore_adapter.spec.ts (2)

39-39: Correct addition of dataset ID to mock data.

The mock dataset state properly includes the new id field, ensuring tests reflect the updated dataset identification scheme.

162-162: Consistent URL pattern updates in test assertions.

All test expectations have been systematically updated from the previous organization/datasetPath pattern to use datasetId, correctly reflecting the new unified dataset identification approach.

Also applies to: 166-166, 198-198, 218-218, 237-237

frontend/javascripts/viewer/view/right-border-tabs/segments_tab/segments_view_helper.tsx (1)

79-79: Correct update to unified dataset identification.

The change from dataset.directoryName to dataset.id properly aligns with the system-wide refactor to use dataset IDs as the primary identifier, and the removal of the organization parameter matches the updated hasSegmentIndexInDataStore API signature.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DatasetCache.scala (1)

29-29: Improved method signature with direct ObjectId parameter.

The change from String to ObjectId parameter is excellent because it:

Eliminates potential conversion errors

Provides better type safety

Aligns with controller usage patterns (as seen in DataSourceController.scala)

Maintains consistency with the cache's ObjectId key type

conf/webknossos.versioned.routes (2)

6-7: Clear documentation of API version changes.

The version log entry accurately describes the dataset identification changes in v10, providing helpful context for API consumers.

18-18: Proper API versioning implementation.

The v10 route is correctly:

Placed above v9 for proper routing priority

Mapped to webknossos.latest.Routes following established patterns

Supporting the unified dataset ID approach described in the PR objectives

webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/volume/VolumeTracingService.scala (1)

977-979: LGTM! Clean migration to dataset ID-based identification.

The changes correctly implement the transition from using dataSource.id to datasetId for RemoteFallbackLayer creation. The approach maintains the existing functionality while aligning with the broader codebase refactor to unified dataset identification.

frontend/javascripts/viewer/view/right-border-tabs/connectome_tab/connectome_view.tsx (2)

15-15: Import statement correctly updated for dataset ID refactoring.

The removal of APIDataSourceId from imports aligns with the transition to unified dataset identification using APIDataset.

353-353: Type annotation correctly updated to use APIDataset.

The change from APIDataSourceId to APIDataset in the fetchProperties tuple type is consistent with the codebase-wide migration to dataset ID-based identification and matches the actual usage on line 355 where dataset is passed.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/UploadService.scala (2)

444-444: LGTM: Clean API adaptation for optional dataset ID parameter.

The explicit None arguments align with the updated deleteOnDisk method signature that now accepts an optional datasetId: Option[ObjectId] parameter. This maintains the existing cleanup behavior while supporting the new unified dataset identification approach.

Also applies to: 452-452

499-499: LGTM: Method rename clarifies on-disk operations.

The change from updateDataSource to updateDataSourceOnDisk provides better clarity about the method's purpose and distinguishes it from remote operations, aligning with the broader refactor.

frontend/javascripts/viewer/model/sagas/meshes/precomputed_mesh_saga.ts (1)

287-287: LGTM: Parameter updates align with unified dataset identification.

The changes from passing the full dataset object to just dataset.id align with the backend refactor to use unified dataset IDs instead of composite organization/dataset name identifiers. The mesh API functions now correctly receive the dataset ID parameter they expect.

Also applies to: 367-367

conf/evolutions/reversions/136-virtual-datasets.sql (1)

1-12: LGTM: Well-structured database reversion script.

The migration properly:

Validates the current schema version (136)

Drops the isVirtual column from the datasets table

Recreates the datasets_ view to maintain filtering on non-deleted datasets

Updates the schema version back to 135

Uses transaction boundaries for atomicity

The script correctly handles the reversion of virtual dataset schema changes.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DataSourceService.scala (1)

272-272: LGTM: Method rename improves clarity.

The rename from updateDataSource to updateDataSourceOnDisk better communicates the method's purpose of persisting changes to disk, distinguishing it from in-memory or remote operations. This aligns with the broader refactor and improves code readability.

frontend/javascripts/admin/dataset/composition_wizard/04_configure_new_dataset.tsx (1)

171-178: LGTM: API call simplification aligns with backend changes.

The removal of the explicit datastore URL argument simplifies the call to match the updated createDatasetComposition API signature. The function now correctly receives only the dataset composition parameters object, aligning with the backend's unified dataset identification approach.

conf/evolutions/136-virtual-datasets.sql (1)

1-12: LGTM! Well-structured database migration.

The migration script correctly adds support for virtual datasets by adding an isVirtual boolean column with proper defaults and constraints. The transaction boundaries, schema version validation, and view recreation are all handled appropriately.

frontend/javascripts/types/schemas/datasource.types.ts (1)

17-19: Good refactoring to use intersection types.

The change from an explicit object type to an intersection type with BaseRemoteLayer reduces code duplication and improves maintainability. The boundingBox property is now inherited from BaseRemoteLayer, which is more consistent with the other data layer types.

frontend/javascripts/viewer/model/bucket_data_handling/wkstore_adapter.ts (1)

100-105: LGTM! Simplified URL construction with unified dataset ID.

The change from using separate dataset.directoryName and dataset.owningOrganization to a single dataset.id simplifies the URL construction and aligns with the system-wide migration to unified dataset identification.

frontend/javascripts/test/backend-snapshot-tests/datasets.e2e.ts (4)

99-107: Good addition of dynamic dataset ID resolution.

The helper function getTestDatasetId improves test robustness by dynamically resolving dataset IDs instead of relying on hardcoded values. The error handling for missing datasets is appropriate.

110-123: LGTM! Updated test to use dynamic dataset ID.

The Zarr streaming test correctly uses the resolved dataset ID in template literals, making it more maintainable and aligned with the unified dataset identification approach.

125-145: LGTM! Updated test to use dynamic dataset ID.

The Zarr 3 streaming test correctly uses the resolved dataset ID in template literals, consistent with the refactoring approach.

147-177: LGTM! Updated test to use dynamic dataset ID.

The Dataset Paths test correctly uses the resolved dataset ID, maintaining consistency with the system-wide migration to unified dataset identification.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/models/datasource/DataLayer.scala (2)

493-508: LGTM! Added WKW layer support to DataLayerWithMagLocators trait.

The addition of WKWDataLayer and WKWSegmentationLayer cases to the mapped method follows the same pattern as other layer types, properly applying transformations and returning updated copies. This integrates WKW layers into the unified data layer framework.

522-523: LGTM! Added WKW layer support to getMags method.

The addition of WKWDataLayer and WKWSegmentationLayer cases to the getMags method is consistent with other layer types and properly returns the mags list. This completes the integration of WKW layers into the DataLayerWithMagLocators trait.

app/controllers/UserTokenController.scala (1)

185-198: LGTM! Consistent implementation of dataset delete access control.

The implementation follows the established pattern from handleDataSourceAccess and includes all necessary validations:

Configuration check for delete feature enablement

Dataset existence validation with global access context

User authentication and organization/admin permission checks

The match expression properly handles the new delete access mode.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/AccessTokenService.scala (1)

46-56: LGTM! Clean addition of ObjectId-based access request methods.

The overloaded methods provide a clean transition to dataset ID-centric access control by converting ObjectId to string and reusing the existing DataSourceId construction pattern. This maintains consistency with the broader refactoring to use dataset IDs throughout the codebase.

app/controllers/DatasetController.scala (3)

92-92: LGTM! ComposeService dependency injection aligns with virtual dataset functionality.

The addition of ComposeService to the constructor properly enables the new dataset composition capabilities.

149-153: LGTM! Method name change reflects shift to database-based dataset creation.

The change from addRemoteDatasource to addRemoteDatasourceToDatabase indicates a transition from RPC-based remote calls to direct database operations for virtual dataset creation, which aligns with the virtual dataset implementation.

494-500: LGTM! Clean implementation of dataset composition endpoint.

The compose endpoint properly:

Uses secured action with JSON validation

Delegates to ComposeService.composeDataset

Returns the new dataset ID in a structured JSON response

Includes appropriate error handling with custom error key

The implementation follows established controller patterns in the codebase.

frontend/javascripts/dashboard/dataset/dataset_settings_view.tsx (2)

369-369: LGTM! Parameter update aligns with backend dataset ID refactoring.

The change correctly updates the updateDatasetDatasource call to:

Use dataset.id instead of dataset.directoryName

Match the new parameter order (dataStoreUrl, dataSource, datasetId)

This aligns with the backend's shift from directory-name-based to ID-based dataset identification.

586-586: LGTM! Passing full dataset object provides richer context.

Changing from dataSourceId={maybeDataSourceId} to dataset={this.state.dataset} provides the DatasetSettingsViewConfigTab component with the complete dataset object rather than partial identifier information, enabling more comprehensive dataset handling.

docs/data/concepts.md (3)

26-26: LGTM! Terminology update aligns with codebase refactoring.

The change from "resolutions" to "mags" consistently reflects the backend refactoring throughout the WEBKNOSSOS codebase that replaced wkwResolutions with mags for magnification steps.

72-94: LGTM! JSON metadata example properly reflects new mags structure.

The updated JSON example correctly shows:

mags array replacing wkwResolutions

mag field (3-tuple array) instead of resolution

path field for data location

Consistent structure across both color and segmentation layers

This aligns with the backend schema changes for WKW data layers.

103-105: LGTM! Clear explanation of new mags structure.

The documentation properly explains that mag is always a 3-element array and path specifies data location. The terminology update from "resolutions" to "mags" is consistent throughout.

app/models/dataset/explore/WKExploreRemoteLayerService.scala (1)

108-124: Method refactoring looks good with improved consistency.

The rename from addRemoteDatasource to addRemoteDatasourceToDatabase better reflects the method's purpose. The elimination of organization lookup and token generation simplifies the logic while maintaining functionality through the new datasetService.createVirtualDataset approach.

The parameter changes are consistent with the broader dataset ID unification effort.

app/controllers/AnnotationIOController.scala (2)

340-342: Parameter simplification aligns with dataset ID unification.

The change from organizationId and datasetName parameters to a single datasetId string parameter simplifies the method signature and aligns with the broader refactoring effort to use dataset IDs consistently across the codebase.

365-365: Consistent parameter usage in RPC call.

The call to remoteDataStoreClient.hasSegmentIndexFile correctly uses the new datasetId parameter, maintaining consistency with the updated method signature.

frontend/javascripts/dashboard/dataset/dataset_settings_viewconfig_tab.tsx (3)

36-40: Props change improves component design.

Changing from dataSourceId to accepting the full dataset object provides better context and enables more comprehensive validation. The prop typing with | null | undefined properly handles loading states.

46-100: Comprehensive refactoring with proper error handling.

The validateDefaultMappings function refactoring is well-implemented:

Proper null check for dataset prevents runtime errors

API calls updated to use dataset object consistently

Error handling maintains user-friendly messages

Caching mechanism preserved

The async validation logic correctly handles the dataset dependency.

312-312: Consistent parameter passing.

The call to validateDefaultMappings correctly passes the dataset prop instead of the previous dataSourceId, maintaining consistency with the updated function signature.

app/controllers/WKRemoteTracingStoreController.scala (2)

124-133: Method rename and simplification improve clarity.

The rename from dataSourceIdForAnnotation to datasetIdForAnnotation better reflects the returned value. The simplified logic that directly returns dataset._id.toString eliminates unnecessary organization lookups while maintaining the required functionality.

149-158: Direct dataset ID usage simplifies method logic.

The dataStoreUriForDataset method now accepts datasetId: ObjectId directly, eliminating the need for organization-based fallback logic. This streamlines the code while maintaining the same functionality through direct dataset lookup.

app/models/annotation/AnnotationService.scala (4)

121-124: Method signature update aligns with dataset ID unification.

The addition of datasetId: ObjectId parameter to createVolumeTracing is consistent with the broader refactoring effort to use dataset IDs throughout the codebase instead of separate organization and dataset name components.

142-142: RPC call parameter correctly updated.

The call to remoteDatastoreClient.hasSegmentIndexFile correctly uses datasetId.toString and layer.name, consistent with the updated client method signature shown in the relevant code snippets.

241-241: Call site properly updated with new parameter.

The call to createVolumeTracing in createTracingForExplorational correctly passes dataset._id for the new datasetId parameter, maintaining consistency with the updated method signature.

435-435: Additional call site correctly updated.

The call to createVolumeTracing in createVolumeTracingBase properly passes the datasetId parameter, ensuring all call sites are consistently updated with the new method signature.

frontend/javascripts/types/schemas/datasource.schema.ts (1)

52-87: Schema changes look consistent and well-structured.

The renaming of wkwResolutions to mags and resolution to mag aligns with the broader refactor to unify dataset identification. The addition of new properties (path, credentials, axisOrder) with proper typing and the updated required fields maintain schema consistency across all data layer formats.

webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteWebknossosClient.scala (4)

13-13: Good cleanup: Removed unused DataSourceId import.

The import simplification removes unused DataSourceId while retaining the necessary DataSourceLike, which aligns with the refactor to use dataset IDs instead of composite data source identifiers.

53-53: Cache type simplification improves consistency.

Changing the cache value type from DataSourceId to String simplifies the caching mechanism and aligns with the unified dataset identification approach using dataset IDs.

71-75: Method signature and endpoint updates are consistent.

The refactor from separate organizationId and datasetDirectoryName parameters to a single datasetId parameter, along with the corresponding RPC endpoint path update, properly implements the unified dataset identification approach.

77-86: Method rename and type changes align with refactor goals.

The rename from getDataSourceIdForAnnotation to getDatasetIdForAnnotation with the return type change from Fox[DataSourceId] to Fox[String] consistently implements the shift to dataset ID-based identification throughout the system.

webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/RemoteFallbackLayer.scala (3)

18-18: Case class simplification improves the data model.

The refactor from separate organizationId and datasetDirectoryName fields to a single datasetId string, alongside layerName and elementClass, aligns with the unified dataset identification approach and simplifies the data structure.

21-24: Method rename and parameter updates are consistent.

The rename from fromDataLayerAndDataSource to fromDataLayerAndDatasetId with the parameter change from DataSourceId to datasetId string properly implements the shift to dataset ID-based identification throughout the system.

37-38: FallbackDataHelper updates maintain functionality.

The update to use getDatasetIdForAnnotation and construct RemoteFallbackLayer with the unified datasetId maintains the same functionality while adopting the new dataset identification scheme.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DSRemoteWebknossosClient.scala (4)

53-58: New case class is well-structured.

The DataSourceRegistrationInfo case class properly encapsulates dataSource and folderId with appropriate JSON formatting, supporting the new dataset registration workflow.

130-133: Method parameter update aligns with unified identification.

The fetchPaths method update to accept datasetId: ObjectId instead of composite identifiers, with corresponding RPC URL construction, properly implements the dataset ID-based approach.

144-162: New dataset management methods are well-implemented.

The new registerDataSource and updateDataSource methods properly implement dataset ID-based RPC calls with appropriate token context handling and consistent URL construction patterns.

168-172: Virtual dataset deletion method follows consistent pattern.

The deleteVirtualDataset method properly implements the deletion workflow using dataset ID with appropriate RPC call structure and token handling.

app/models/dataset/WKRemoteDataStoreClient.scala (3)

24-25: Cache key simplification improves efficiency.

The cache key change from (String, String, String) to (String, String) removes the organization component, simplifying the caching mechanism while maintaining the necessary (datasetId, layerName) identification.

37-37: RPC URL construction consistently uses dataset ID.

All dataset-related RPC calls have been properly updated to use dataset._id in URL construction instead of separate organization and dataset name components, implementing the unified dataset identification approach throughout the client.

Also applies to: 60-60, 67-67, 85-85, 102-102

80-90: Method parameter update maintains functionality.

The hasSegmentIndexFile method update to accept datasetId: String instead of separate organization and dataset parameters, with corresponding cache key and RPC URL updates, properly implements the unified dataset identification while preserving the caching behavior.

frontend/javascripts/admin/api/mesh.ts (2)

29-66: LGTM! Correctly implements the dataset ID migration.

The parameter change from composite identifiers to a single datasetId string and the corresponding URL update properly align with the system-wide migration to unified dataset identification.

79-87: LGTM! Consistent with the dataset ID unification.

The changes mirror those in getMeshfileChunksForSegment, maintaining consistency in the API.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/helpers/DatasetDeleter.scala (3)

25-31: Good addition for checking dataset existence on disk.

The existsOnDisk method properly handles both regular and conversion directory paths, which is useful for the virtual dataset implementation.

85-89: Clean refactoring to use dataset ID directly.

The changes to moveSymlinks properly align with the dataset ID unification by using the ID directly for fetching paths instead of constructing a DataSourceId.

76-76: Symlink handling for dataset deletion is correct

All regular deletion flows supply a defined datasetId, so moveSymlinks will run where needed, and virtual or in-conversion datasets legitimately skip symlink operations:

In DataSourceController.deleteOnDisk, calls use Some(datasetId), invoking moveSymlinks for real datasets

UploadService.deleteOnDisk passes None only on failed uploads (no symlinks to move)

WKRemoteDataStoreController only allows deletion of virtual datasets and never provides an ID, so symlinks are rightly untouched

No changes required.
app/models/dataset/Dataset.scala (4)

65-65: Proper implementation of the isVirtual field.

The addition of the isVirtual boolean field is correctly integrated throughout the Dataset model, including parsing, insertion, and the case class definition.

Also applies to: 149-149, 625-625, 634-634

522-530: LGTM! Useful method for retrieving virtual datasets.

The getVirtualDatasetIds method provides a clean way to query all virtual datasets, which will be helpful for management and maintenance tasks.

693-693: Correct exclusion of virtual datasets from deactivation.

Virtual datasets should not be deactivated when unreported by the datastore since they don't have physical storage.

777-779: Path column migration confirmed

Verified that conf/evolutions/126-mag-real-paths.sql includes:
ALTER TABLE webknossos.dataset_mags ADD COLUMN path TEXT;
and tools/postgres/schema.sql defines path TEXT on webknossos.dataset_mags. No further action needed.
webknossos-datastore/conf/datastore.versioned.routes (3)

3-3: Good API versioning strategy.

Adding v10 for the new dataset ID-based routes while preserving v9 maintains backward compatibility.

6-44: Comprehensive legacy route preservation.

The explicit definition of all v9 routes mapped to the LegacyController ensures backward compatibility for existing integrations. The routes properly maintain the old parameter structure with organizationId and datasetDirectoryName.

16-17: Confirm intentional unversioned legacy routes

I didn’t find any other unversioned /datasets/... entries in webknossos-datastore/conf/datastore.versioned.routes. The only two routes missing the /v9/ prefix are:

Line 16:
POST /datasets/:organizationId/:datasetDirectoryName/layers/:dataLayerName/adHocMesh @LegacyController.requestAdHocMesh(...)

Line 17:
GET /datasets/:organizationId/:datasetDirectoryName/layers/:dataLayerName/mappings/:mappingName @LegacyController.mappingJson(...)

Please verify whether these should remain unversioned for backward compatibility. If not, update them to include the /v9/ prefix for consistency.

app/models/dataset/ComposeService.scala (2)

40-53: Well-structured dataset composition implementation.

The composeDataset method properly validates composability, creates the datasource, and generates a virtual dataset. Good use of the Fox monad for error handling.

63-67: Correct handling of coordinate transformations.

The transformation concatenation logic properly appends new transformations to existing ones, preserving the transformation chain.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/DSFullMeshService.scala (1)

55-62: LGTM! Clean refactoring to pass objects directly.

The signature changes from separate identifiers to DataSource and DataLayer objects improve performance by eliminating redundant repository lookups.

webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteDatastoreClient.scala (1)

39-39: Cache key type simplified correctly.

The change from composite tuple keys to single dataset ID strings aligns with the unified identification scheme.

app/controllers/WKRemoteDataStoreController.scala (2)

245-255: Virtual dataset deletion endpoint implemented correctly.

The endpoint properly validates the dataset ID format, checks that the dataset exists and is virtual before marking it as deleted. Good error handling with appropriate HTTP status codes.

286-309: Virtual dataset registration endpoint well-implemented.

Good security with user token validation, organization membership check, and folder access verification. The endpoint properly creates virtual datasets through the DatasetService.

app/models/dataset/DatasetService.scala (1)

103-124: Virtual dataset creation method well-structured.

Excellent validation flow including dataset name validation, uniqueness check, folder access control, and proper user association. The implementation follows established patterns.

conf/webknossos.latest.routes (2)

96-96: Dataset composition route added correctly.

The new /datasets/compose endpoint enables virtual dataset creation from multiple datasets as intended.

112-115: Dataset management routes properly updated.

Routes correctly transitioned to use ObjectId dataset IDs and new virtual dataset endpoints added with appropriate paths.

Also applies to: 121-121

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DSMeshController.scala (2)

4-4: Consistent refactoring to unified dataset identification

The refactoring from composite identifiers (organizationId + datasetDirectoryName) to a single ObjectId is well-executed throughout the controller. All methods follow a consistent pattern of using datasetCache.getWithLayer for data retrieval and UserAccessRequest.readDataset(datasetId) for access control.

Also applies to: 23-23, 33-96

93-93: Improved service method signature

Good architectural improvement - passing DataSource and DataLayer objects directly to fullMeshService.loadFor eliminates redundant repository lookups within the service.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/WKWDataLayers.scala (3)

23-23: Trait inheritance change aligns with unified magnification handling

The change from DataLayer to DataLayerWithMagLocators properly integrates WKW layers with the unified magnification system.

60-97: Well-designed backward-compatible JSON serialization

The custom JSON formats elegantly handle the migration from wkwResolutions to mags by:

Attempting to read the new mags field first

Falling back to legacy wkwResolutions with automatic conversion

Providing clear error messages when neither field is present

This ensures smooth migration without breaking existing clients.

Also applies to: 119-158

34-34: defaultCubeSize is only a legacy fallback; real WKW cubeLength comes from the DB

I’ve confirmed that for WKW datasets the wkwResolutionsOpt field is populated from the dataset_mags table (via the DatasetMagsDAO.updateMags logic), and that WKWDataLayers uses those database-provided cubeLength values. The hardcoded defaultCubeSize = 1024 only applies in the final fallback branch when no wkwResolutionsOpt exists (i.e. legacy data). No change is required.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/ZarrStreamingService.scala (4)

1-29: Well-structured Zarr streaming service

Good architectural decision to extract Zarr-specific logic into a dedicated service. The service properly:

Supports both Zarr v2 and v3 formats

Centralizes format conversion and header generation

Uses appropriate dependency injection

63-112: Correct Zarr layer conversion implementation

The layer conversion properly:

Constructs Zarr-compliant paths

Sets appropriate axis ordering (c + additional + xyz)

Handles the uint24 special case for RGB data (3 channels)

Maintains all layer properties during conversion

114-146: Robust Zarr cube data retrieval

The implementation correctly:

Parses multi-dimensional coordinates with additional axes support

Validates magnification values

Properly scales coordinates by cube size and magnification

Handles missing chunks with appropriate error messages

170-223: Comprehensive directory listing implementations

The directory listing methods correctly:

Return version-appropriate metadata files (.zattrs/.zgroup for v2, zarr.json for v3)

Handle all directory levels (dataset, layer, magnification)

Support private link access with proper layer filtering

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/BinaryDataController.scala (1)

7-7: Consistent dataset ID refactoring across all endpoints

All controller methods have been properly refactored to:

Accept datasetId: ObjectId instead of organizationId/datasetDirectoryName pairs

Use datasetCache for data retrieval

Update access control to UserAccessRequest.readDataset(datasetId)

The refactoring maintains all existing functionality while simplifying the API.

Also applies to: 36-36, 55-273

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/ZarrStreamingController.scala (2)

5-5: Excellent controller simplification through service delegation

The controller refactoring effectively:

Delegates all Zarr-specific logic to ZarrStreamingService

Removes duplicate code and complex helper methods

Maintains clean separation between access control and business logic

Preserves all existing functionality for both direct and private link access

Also applies to: 23-28, 44-126

301-315: Well-structured private link access handling

The ifIsAnnotationLayerOrElse helper method elegantly handles the distinction between annotation layers (served from tracingstore) and data layers (served from datastore), maintaining proper access control through token contexts.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/LegacyController.scala (2)

90-114: Legacy pattern correctly implemented

The method properly uses the legacy pattern with organizationId and datasetDirectoryName parameters, which aligns with the controller's purpose of maintaining backward compatibility.

539-547: Verify the use of synchronous validation

This method uses validateAccessFromTokenContextForSyncBlock while all other methods use the async validateAccessFromTokenContext. Please verify this is intentional and document why synchronous validation is needed here.

frontend/javascripts/admin/rest_api.ts (2)

841-851: Dataset ID migration correctly implemented

The function properly migrates from separate organization/dataset parameters to a unified dataset ID approach, maintaining consistency with the backend changes.

1347-1357: Parameter type matches usage

The function correctly uses dataset.id from the dataset object parameter. The implementation is consistent with the dataset ID migration.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DataSourceController.scala (10)

5-5: LGTM! Consistent refactoring to use dataset ID.

The changes properly implement the transition from DataSource IDs to dataset IDs, with appropriate access control updates and cache utilization.

Also applies to: 68-74

244-257: Transitional implementation looks good.

The method correctly bridges the new dataset ID approach with the existing mapping exploration logic. The comment acknowledges this is temporary until mappings support dataset IDs directly.

259-269: Efficient implementation using getWithLayer.

Good use of the cache method that retrieves both data source and layer atomically, reducing database calls.

350-364: Good handling of hybrid storage model.

The implementation correctly handles both disk-based and virtual datasets during the transition period. The comment properly documents this temporary dual-path approach.

366-378: Clean refactoring to support virtual datasets.

The method has been appropriately simplified to delegate dataset registration to the remote client, returning the newly created dataset ID.

409-427: Performance improvement by eliminating redundant disk reads.

Good refactoring that accepts the data source as a parameter instead of reloading it from disk, reducing unnecessary I/O operations.

429-439: Consistent use of cache infrastructure.

The reload method properly leverages the dataset cache for both invalidation and reloading, aligning with the virtual dataset architecture.

441-461: Well-structured deletion logic for hybrid storage.

The method correctly handles deletion for both disk-based and virtual datasets, with appropriate permission checks and cleanup operations.

463-471: Consistent refactoring across all segment and connectome methods.

All methods have been uniformly updated to use dataset IDs, proper cache access, and updated permissions. The refactoring maintains consistency throughout.

Also applies to: 473-484, 486-504, 506-517, 519-530, 532-540, 547-576, 583-609, 611-631, 633-651

685-690: Useful cache management endpoint.

Good addition of a dedicated cache invalidation endpoint with appropriate write permissions.

app/models/dataset/ComposeService.scala

...sos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteDatastoreClient.scala

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (9)

docs/data/concepts.md (1)
125-127: Fix markdown list formatting inconsistencies.

The static analysis tool flagged inconsistent list bullet styles. For better markdown compliance, consider using consistent dash (-) style bullets throughout:
- `dataLayers.mags`: Holds information about the available magnification steps of the layer.
-   * `dataLayers.mags.mag`: A 3-tuple (e.g., `[1, 1, 1]`, `[2, 2, 1]`) for uniform or non-uniform magnifications.
-   * `dataLayers.mags.path`: The path to the directory containing the data for this magnification step.
+ `dataLayers.mags`: Holds information about the available magnification steps of the layer.
+   - `dataLayers.mags.mag`: A 3-tuple (e.g., `[1, 1, 1]`, `[2, 2, 1]`) for uniform or non-uniform magnifications.
+   - `dataLayers.mags.path`: The path to the directory containing the data for this magnification step.
webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/RemoteFallbackLayer.scala (1)
11-11: Consider removing unused ObjectId import.

The ObjectId import appears to be unused in this file. If it's not needed for compilation or used in code not shown, consider removing it to keep imports clean.
#!/bin/bash
# Description: Check if ObjectId is used in this file
# Expected: If not used, the import can be removed

# Search for ObjectId usage in the file
rg "ObjectId" webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/RemoteFallbackLayer.scala
webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/WKWDataLayers.scala (1)
23-40: Good trait refactoring with a minor concern

The extension of DataLayerWithMagLocators improves the abstraction. However, the hardcoded defaultCubeSize = 1024 might be limiting for different use cases.

Consider making the cube size configurable:
-  def defaultCubeSize = 1024
+  def defaultCubeSize: Int = 1024  // Can be overridden if needed
app/models/dataset/ComposeService.scala (2)
40-53: Consider improving error messaging

The implementation is correct, but the error message could be more helpful by indicating which datasets are on different data stores.
-      _ <- isComposable(composeRequest) ?~> "Datasets are not composable, they are not on the same data store"
+      _ <- isComposable(composeRequest) ?~> "dataset.compose.differentDataStores"
55-76: Consider more descriptive error message

The implementation correctly handles layer transformation and validation.
-        case _ => Fox.failure("Unsupported layer type for composition: " + layer.getClass.getSimpleName)
+        case _ => Fox.failure(s"Unsupported layer type for composition: ${layer.getClass.getSimpleName}. Only DataLayerWithMagLocators is supported.")
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/ZarrStreamingService.scala (2)
114-146: Remove unnecessary Fox.successful() call.

Line 121 creates an empty successful Fox that serves no purpose.
 def rawZarrCube(
     dataSource: DataSource,
     dataLayer: DataLayer,
     mag: String,
     coordinates: String
 )(implicit m: MessagesProvider, tc: TokenContext): Fox[Array[Byte]] =
   for {
-    _ <- Fox.successful(())
     reorderedAdditionalAxes = dataLayer.additionalAxes.map(reorderAdditionalAxes)
170-207: Remove unnecessary Fox.successful() calls in directory methods.

Multiple methods contain unnecessary Fox.successful(()) calls that can be removed.
 def dataLayerDirectoryContents(
     dataSource: DataSource,
     dataLayer: DataLayer,
     zarrVersion: Int
 ): Fox[List[String]] =
   for {
-    _ <- Fox.successful(())
     mags = dataLayer.sortedMags
     
 def dataSourceDirectoryContents(
     dataSource: DataSource,
     zarrVersion: Int
 ): Fox[List[String]] =
   for {
-    _ <- Fox.successful(())
     layerNames = dataSource.dataLayers.map((dataLayer: DataLayer) => dataLayer.name)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/LegacyController.scala (1)

90-356: Legacy binary data methods correctly preserve old API behavior.

The methods maintain the original API signatures and use dataSourceRepository for backward compatibility. While there's code duplication with BinaryDataController, this is acceptable for a legacy compatibility layer.

Consider adding deprecation notices in the documentation to encourage migration to the new dataset ID-based endpoints.

webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DataSourceController.scala (1)

366-378: Significant architectural change: dataset registration moved to remote service.

The add method now registers datasets remotely rather than creating them on disk, aligning with the virtual dataset architecture. This is a breaking change from the previous disk-based approach.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3f49365 and 4bef4a8.

⛔ Files ignored due to path filters (1)

test/db/dataSets.csv is excluded by !**/*.csv

📒 Files selected for processing (59)

app/controllers/AnnotationIOController.scala (2 hunks)
app/controllers/DatasetController.scala (3 hunks)
app/controllers/UserTokenController.scala (1 hunks)
app/controllers/WKRemoteDataStoreController.scala (3 hunks)
app/controllers/WKRemoteTracingStoreController.scala (2 hunks)
app/models/annotation/AnnotationService.scala (4 hunks)
app/models/dataset/ComposeService.scala (1 hunks)
app/models/dataset/Dataset.scala (7 hunks)
app/models/dataset/DatasetService.scala (10 hunks)
app/models/dataset/WKRemoteDataStoreClient.scala (5 hunks)
app/models/dataset/explore/WKExploreRemoteLayerService.scala (1 hunks)
conf/evolutions/136-virtual-datasets.sql (1 hunks)
conf/evolutions/reversions/136-virtual-datasets.sql (1 hunks)
conf/webknossos.latest.routes (3 hunks)
conf/webknossos.versioned.routes (2 hunks)
docs/data/concepts.md (6 hunks)
frontend/javascripts/admin/api/mesh.ts (4 hunks)
frontend/javascripts/admin/dataset/composition_wizard/04_configure_new_dataset.tsx (1 hunks)
frontend/javascripts/admin/rest_api.ts (20 hunks)
frontend/javascripts/dashboard/dataset/dataset_settings_data_tab.tsx (0 hunks)
frontend/javascripts/dashboard/dataset/dataset_settings_delete_tab.tsx (1 hunks)
frontend/javascripts/dashboard/dataset/dataset_settings_view.tsx (2 hunks)
frontend/javascripts/dashboard/dataset/dataset_settings_viewconfig_tab.tsx (3 hunks)
frontend/javascripts/test/backend-snapshot-tests/datasets.e2e.ts (3 hunks)
frontend/javascripts/test/model/binary/layers/wkstore_adapter.spec.ts (5 hunks)
frontend/javascripts/types/schemas/datasource.schema.ts (2 hunks)
frontend/javascripts/types/schemas/datasource.types.ts (1 hunks)
frontend/javascripts/viewer/model/bucket_data_handling/wkstore_adapter.ts (1 hunks)
frontend/javascripts/viewer/model/sagas/load_histogram_data_saga.ts (1 hunks)
frontend/javascripts/viewer/model/sagas/meshes/precomputed_mesh_saga.ts (2 hunks)
frontend/javascripts/viewer/view/right-border-tabs/connectome_tab/connectome_view.tsx (2 hunks)
frontend/javascripts/viewer/view/right-border-tabs/segments_tab/segments_view_helper.tsx (1 hunks)
tools/postgres/schema.sql (2 hunks)
unreleased_changes/8708.md (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/DataStoreModule.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/BinaryDataController.scala (10 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DSMeshController.scala (6 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/DataSourceController.scala (18 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/LegacyController.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/WKDatasetController.scala (0 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/ZarrStreamingController.scala (12 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/dataformats/layers/WKWDataLayers.scala (5 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/helpers/DatasetDeleter.scala (4 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/models/datasource/DataLayer.scala (2 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/AccessTokenService.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DSRemoteWebknossosClient.scala (4 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DataSourceService.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/DatasetCache.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/ZarrStreamingService.scala (1 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/mesh/DSFullMeshService.scala (3 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/ComposeService.scala (0 hunks)
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/UploadService.scala (2 hunks)
webknossos-datastore/conf/datastore.latest.routes (4 hunks)
webknossos-datastore/conf/datastore.versioned.routes (1 hunks)
webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteDatastoreClient.scala (2 hunks)
webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/TSRemoteWebknossosClient.scala (3 hunks)
webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/RemoteFallbackLayer.scala (2 hunks)
webknossos-tracingstore/app/com/scalableminds/webknossos/tracingstore/tracings/volume/VolumeTracingService.scala (1 hunks)
webknossos-tracingstore/conf/tracingstore.versioned.routes (1 hunks)

💤 Files with no reviewable changes (3)

frontend/javascripts/dashboard/dataset/dataset_settings_data_tab.tsx
webknossos-datastore/app/com/scalableminds/webknossos/datastore/services/uploading/ComposeService.scala
webknossos-datastore/app/com/scalableminds/webknossos/datastore/controllers/WKDatasetController.scala

🧰 Additional context used

🧠 Learnings (47)

frontend/javascripts/viewer/model/sagas/load_histogram_data_saga.ts (3)